Skip to content

Conversation

@Nexesenex
Copy link
Contributor

@Nexesenex Nexesenex commented Oct 30, 2025

Pursuing on my quest to update a bit the server, here's the Middleware refactor, following straight the bunch of 6 server PRs merged recently.

Here are the mainline PRs involved:

server : refactor middleware and /health endpoint (#9056)

  • server : refactor middleware and /health endpoint
  • move "fail_on_no_slot" to /slots
  • Update examples/server/server.cpp
  • fix server tests
  • fix CI
  • update server docs

server : fix crash when error handler dumps invalid utf-8 json (#9195)

This time, I had to solve a bunch of conflicts, of course.

On my side, it works properly, and I tried speculative decoding as well without apparent bug, but I need a review by someone more competent before it can be merged, if pertinent.

server : refactor middleware and /health endpoint (#9056)

* server : refactor middleware and /health endpoint

* move "fail_on_no_slot" to /slots

* Update examples/server/server.cpp

* fix server tests

* fix CI

* update server docs

server : fix crash when error handler dumps invalid utf-8 json (#9195)
@Nexesenex Nexesenex marked this pull request as draft October 30, 2025 14:02
@ikawrakow
Copy link
Owner

I'm not very familiar with the server code, so invite review by others.

@Nexesenex Nexesenex marked this pull request as ready for review October 30, 2025 18:16
@Nexesenex
Copy link
Contributor Author

Nexesenex commented Oct 30, 2025

@saood06 , @firecoperana , could any of your folks get a look at this PR when you have some spare time?

@firecoperana
Copy link
Collaborator

I just tried this PR, and I don't see any errors when using the webui. This PR changes return results for endpoints like health. It also removed Access-Control-Allow-Origin in all endpoints. This is good from the security perspective, but not sure how it affects others. It could be no issue since it was merged in mainline a long time ago.

@Nexesenex
Copy link
Contributor Author

Great! Then one more pair of eyes, and then, we're good to go!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants